Skip to content

Conversation

@alebedev87
Copy link
Contributor

@alebedev87 alebedev87 commented Apr 15, 2025

This PR adds a check for the service associated with a deleted gateway. The test now waits for the service to be fully removed and logs a warning if it persists beyond a specified timeout. This helps identify delays in deprovisioning of cloud load balancers backing gateway services.

@openshift-ci openshift-ci bot requested review from grzpiotrowski and knobunc April 15, 2025 00:01
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Apr 15, 2025

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign frobware for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@alebedev87 alebedev87 changed the title e2e: Signal service deprovisioning problems during Gateway DNS test [WIP] e2e: Signal service deprovisioning problems during Gateway DNS test Apr 15, 2025
@openshift-ci openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Apr 15, 2025
@alebedev87 alebedev87 force-pushed the dnsrecord-listener-update branch from 301e0bc to a079a5a Compare April 15, 2025 09:25
@melvinjoseph86
Copy link

/retest-required

1 similar comment
@melvinjoseph86
Copy link

/retest-required

This adds a check for the service associated with a deleted gateway.
The test now waits for the service to be fully removed and
logs a warning if it persists beyond a specified timeout.
This helps identify delays in deprovisioning of cloud load balancers
backing gateway services.
@alebedev87 alebedev87 force-pushed the dnsrecord-listener-update branch from a079a5a to 849cdc8 Compare May 13, 2025 15:15
@alebedev87 alebedev87 changed the title [WIP] e2e: Signal service deprovisioning problems during Gateway DNS test e2e: Signal service deprovisioning issues during Gateway DNS test May 13, 2025
@openshift-ci openshift-ci bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label May 13, 2025
@alebedev87 alebedev87 changed the title e2e: Signal service deprovisioning issues during Gateway DNS test NE-2032: e2e: Signal service deprovisioning issues during Gateway DNS test May 13, 2025
@openshift-ci-robot openshift-ci-robot added the jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. label May 13, 2025
@openshift-ci-robot
Copy link
Contributor

openshift-ci-robot commented May 13, 2025

@alebedev87: This pull request references NE-2032 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.20.0" version, but no target version was set.

In response to this:

This PR adds a check for the service associated with a deleted gateway. The test now waits for the service to be fully removed and logs a warning if it persists beyond a specified timeout. This helps identify delays in deprovisioning of cloud load balancers backing gateway services.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@alebedev87
Copy link
Contributor Author

/assign @grzpiotrowski

@alebedev87
Copy link
Contributor Author

/retest

1 similar comment
@alebedev87
Copy link
Contributor Author

/retest

@openshift-bot
Copy link
Contributor

Issues go stale after 90d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle stale

@openshift-ci openshift-ci bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Aug 15, 2025
@alebedev87
Copy link
Contributor Author

/remove-lifecycle stale

@openshift-ci openshift-ci bot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Aug 20, 2025

if err := kclient.Delete(context.TODO(), gateway); err != nil {
t.Errorf("failed to delete gateway %q: %v", gateway.Name, err)
t.Fatalf("Failed to delete gateway %q: %v", gateway.Name, err)
Copy link
Member

@rikatz rikatz Aug 21, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we care about the same problems of networking here? eg.: do you want to retry the delete also?

Additionally, if you do add this delete to the retry function below, it is worth ignoring the error if the object does not exist, as previous loop may have deleted it.

One more comment, out of this change but above on line 646: I would add a RetryOnConflict for that update/patch, as you may have other controllers (GatewayAPI/OSSM) changing it, the Update may fail with a conflict. It would be good to ignore and retry the update

Edit: OTOH it may lead to a false negative if you try to get a service name that doesn't match the gateway name, as it will be not found


// The load balancer deprovisioning can take some time.
// Signal a long deprovisioning to help distinguish it from DNS management problems.
gtwSvcName := types.NamespacedName{Namespace: "openshift-ingress", Name: "test-gateway-update-openshift-default"}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

IIRC the service name is derived from the Gateway name, so instead of calling it "test-gateway-update-openshift-default" do you want to compose the name from the gateway name? This way in case of some change on some logic/naming it will not break the test

@openshift-merge-robot openshift-merge-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Sep 26, 2025
@openshift-merge-robot
Copy link
Contributor

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

1 similar comment
@openshift-merge-robot
Copy link
Contributor

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Oct 3, 2025

@alebedev87: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/e2e-aws-operator-techpreview 849cdc8 link false /test e2e-aws-operator-techpreview
ci/prow/e2e-hypershift 849cdc8 link true /test e2e-hypershift
ci/prow/e2e-gcp-operator 849cdc8 link true /test e2e-gcp-operator
ci/prow/e2e-azure-operator 849cdc8 link true /test e2e-azure-operator
ci/prow/e2e-aws-ovn-hypershift-conformance 849cdc8 link true /test e2e-aws-ovn-hypershift-conformance
ci/prow/e2e-aws-ovn 849cdc8 link true /test e2e-aws-ovn
ci/prow/e2e-aws-ovn-serial 849cdc8 link true /test e2e-aws-ovn-serial
ci/prow/hypershift-e2e-aks 849cdc8 link true /test hypershift-e2e-aks
ci/prow/e2e-aws-operator 849cdc8 link true /test e2e-aws-operator
ci/prow/e2e-aws-ovn-upgrade 849cdc8 link true /test e2e-aws-ovn-upgrade
ci/prow/e2e-aws-ovn-serial-2of2 849cdc8 link true /test e2e-aws-ovn-serial-2of2
ci/prow/e2e-aws-ovn-serial-1of2 849cdc8 link true /test e2e-aws-ovn-serial-1of2

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants